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Abstract. Given n real numbers < xi,...,x n < 1 and a permutation a 
of {1, . . . , n}, we can always find xi, . . . , x n 6 {0, 1} so that the partial sums 
x~\ H — • + Xk and x a \ + ■ ■ ■ + x a k differ from the unrounded values x\ + • — |- x& 
and x a \ + ■ ■ ■ + x a k by at most n/(n + 1), for 1 < k < n. The latter bound is 
best possible. The proof uses an elementary argument about flows in a certain 
network, and leads to a simple algorithm that finds an optimum way to round. 

Many combinatorial optimization problems in integers can be solved or approximately 
solved by first obtaining a real- valued solution and then rounding to integer values. Spencer 
[11] proved that it is always possible to do the rounding so that partial sums in two 
independent orderings are properly rounded. His proof was indirect — a corollary of more 
general results [7] about discrepancies of set systems — and it guaranteed only that the 
rounded partial sums would differ by at most 1 — 2 -2 from the unrounded values. The 
purpose of this note is to give a more direct proof, which leads to a sharper result. 

Let x±, . . . , x n be real numbers and let a be a permutation of {1, ... , n}. We will write 

Sk = X! H hx fc , E k = x al -] \-x ak , 0</c<n, 

for the partial sums in two independent orderings. Our goal is to find integers X\, . . . ,x n 
such that 

[x k \ <x k < \x k ] , 
and such that the rounded partial sums 

S k =Xi-\ hlfc, Sk = %al H VXok 

also satisfy 

[s k \ <s k < \s k ] , [S k \ <Z k < \Sk\ , (*) 

for < k < n. Such will be called a two-way rounding of with respect 

to a. 

Lemma. Two-way rounding is always possible. 

Proof. We can assume without loss of generality that S n = m is an integer, by adding 
an additional term and increasing n if necessary. We can also assume that < Xk < 1 for 
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all k. Construct a network with nodes {s, ai, . . . , a m , ui, . . . , u n , v±, . . . , v n , b±, ... , b m , t} 
and the following arcs:* 

s — > and &j — > t for 1 < j ; < m ; 

Uk ^ Vk f° r 1 < /c < n ; 

a,-->u fc if [j-l..j)n[5fc-i..5fc)^0; 

-> 6, if b' - 1 • • J) n [^fc-i • • z*>) + • 

Each arc has capacity 1. This network supports a natural flow of to units, if we send 1 unit 
through each arc s — ■> aj and &j — > t, and x k units through — > Vk', the flow in aj — > it^ is 
the measure of the interval [j — 1 . . j) fl [Sk-i ■ ■ Sk), and the flow in v a k — > &j is similar. 
Deleting the arcs s — > a^- defines a cut of capacity to, so this must be a minimum cut. 

Since the arc capacities are integers, the max-flow/min-cut theorem implies that this 
network supports an integer flow of m units. Let Xk be the amount that flows through 
Uk — > Vk, for 1 < k < n, in one such flow. Then Xk G {0,1}. If j = \Sk] we have 
Sk = %i + • — h Xfc = flow into {tti, . . . , it/-} < flow out of {ai, . . . , aj} = j, because all arcs 
ai — > it; for / < /c have i < j. If j = [^J then Sk — flow into {u\, . . . , w^} > flow out of 
{ai, . . . , aj} = j, because all arcs — > for z < j have I < k. A similar argument proves 
that L^fcJ < ^fc < r^fcl) hence (*) holds. □ 

Corollary. Given any fixed k, two-way rounding is possible with Xk = \xk \ , as well as 
with x k = [xk\ ■ 

Proof. We may assume as before that < Xk < 1. The construction in the lemma 
establishes a feasible flow of Xk units in the arc Uk — > i>fc. It is well known that the 
polytope of all feasible flows has vertices whose coordinates are integers (see, for example, 
Application 19.2 in Schrijver [10]). Therefore the arc Uk Vk is saturated in at least one 
maximum flow, and it carries no flow at all in at least one other. □ 

Incidentally, it is important to impose a capacity of 1 on the arcs Uk — > Vk in the 
construction of this proof. Otherwise we might get solutions in which Xk = 2. Condition 
(*) does not by itself imply that Xk < \xk~\ ° r that Xk > [xk\- 

Notice that (*) is equivalent to the conditions 

\S k -S k \<l and \ZJ k -~£ k \ <1, for < k < n , 

since S k and E k are integers. Let us say that two-way rounding has discrepancy bounded 
by 5 if \Sk — Sk\ < 8 and \E k — E k \ < 8 for all k. A slight extension of the construction 
in the lemma makes it possible to prove a stronger result: 



* Here and in the sequel [a . . b) denotes the half-open interval {x\a < x < b}. This 
notation, due independently to Hoare and Ramshaw, is recommended in [5]. 



2 



Theorem 1. If S n = m is an integer, the sequence (xi, . . . , x n ) can be two-way rounded 
with discrepancy hounded by (2m + l)/(2m + 2). 

Proof. We will prove that two-way rounding bounded by 5 is possible for all 5 > 
(2m + l)/(2m + 2). Only finitely many roundings exist, so the stated result follows by 
taking the limit as 5 decreases to (2m + 1)/ (2m + 2). 

The proof uses a network like that of the lemma, but we omit certain arcs that would 
lead to discrepancies near 1. More precisely, if e is any fixed positive number < l/(2m + 2), 
we have 

aj ^u k if [j - 1 + e . . j - e) n [S k -i ■ ■ S k ) ^ ; 
v ak ^bj if [j - 1 + e . . j - e) n [A-i • • 2 k ) ^ . 

We also allow these arcs to have infinite capacity. But the capacity of the "source" arcs 
s — > a,-, the "middle" arcs u k — > v k: and the "sink" arcs bj — > t remains 1. 

The minimum cut in this reduced network has size m. For if any m — 1 of the unit- 
capacity arcs are cut, we will prove that we can still connect s to t. Suppose we remove 
p source arcs, q middle arcs, and r sink arcs, where p+q + r = m— 1. We send 1 — 2e 
units of flow from s through each of the m — p remaining source arcs. From every cij 
reached in this way, we send as many units of flow from aj — > u k as the size of the interval 
[j — l + e . . j — e) fl [Sk-i ■ ■ S k ). Some of the flow now gets stuck, if u k is one of the q vertices 
for which the arc u k — > v k was removed. But at most 1 — 2e units flow into each u k , so we 
still have at least (m — p — q)(l — 2e) = (r + 1)(1 — 2e) units of flow arriving at {vi, . . . , v n }. 
Now consider an "antiflow" of 1 — 2e units from t back through each of the m — r remaining 
sink arcs bj — > t. From every such bj we send the antiflow back through v ak bj according 
to the size of [j — 1 + e . . j — e) fl [E k _\ . . E k ). In this way (m — r)(l — 2e) units of antiflow 
come from t to {vi, . . . , v n }. Each vertex v k contains at most x k units of flow and at most 
x k units of antiflow. We know that the total flow plus antiflow at {vi, . . . ,v n } is at least 

(r + 1)(1 - 2e) + (m - r)(l - 2e) = m + 1 - (2m + 2)e > m = xi H hi n . Therefore 

some vertex v k must contain both flow and antiflow. And this establishes the desired link 
between s and t. 

Since m is the size of a minimum cut and all capacities are integers, the network 
supports an integer flow of value m. Let x k be the flow from u k to v k ; we will prove that 
( xi, . . . , x n ) is a two-way rounding with discrepancy < 5 = 1 — e. Note that 

\S k - S k \ < 1 - e ^ [S k + e\ <S k < \S k - e] . 

If j = \S k — e] we have S k = x\ + ■ • ■ + x k = flow into {ui, . . . , u k } < flow out of 
{ai, . . . , aj} = j, because all arcs ai — > u\ for / < k have [i — 1 + e . . i — e)fl 5 1 /) 7^ 0, 

hence i — 1 + e < Si and z < \Si — e] < j. Similarly, if j = [S k + ej we have S k > flow 
out of {ai, . . . , aj} = j, because all arcs — * u\ for % < j have I < k. (If I > k we would 
have > S k > j — e > i — e, contradicting < z — e.) A similar proof shows that 
Lr fc + e J <T fc < pJ7 fc -e"|. □ 

The bound of Theorem 1 is, in fact, best possible, in the sense that no better bound 
can be guaranteed as a function of m. 
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Theorem 2. For all positive integers m there exists a sequence of real numbers (x\ , . . . , x n ) 
with sum m and a permutation a of {l,...,n} that cannot be two-way rounded with 
discrepancy < (2m + 1)/ (2m + 2). 

Proof. Let n = 2m + 2 and e = 1/n. Define 

xi = x 2 = x 3 = e ; x m+3 = (2m - l)e ; 

x k +3 = 2e , Xk+m+3 = 2me , for 1 < k < m ; 

al = 2 , a-2 = 1 , a3 = m + 3 , a(2m + 2) = 3 ; 

a(2k + 2) = fc + 3, <r(2fc + 3) = /c + m + 3 , for l<k<m. 

For example, when m = 4 we have (xi, . . . , x\q) = (.1, .1, .1, .2, .2, .2, .7, .8, .8, .8) and 
(al, . . . , o-10) = (2, 1, 7, 4, 8, 5, 9, 6, 10, 3). Hence 

(Si, . . . , Sio) = (.1, -2, .3, .5, .7, .9, 1.6, 2.4, 3.2, 4.0) , 
(A, . . . , r 10 ) = (.1, .2, .9, 1.1, 1.9, 2.1, 2.9, 3.1, 3.9, 4.0) . 

We will prove that this sequence and permutation cannot be two-way rounded with dis- 
crepancy less than (2m + l)/(2m + 2) = 0.9; the same proof technique will work for any 
m > 1. 

The main point is that whenever Sk or Uk has the form / ± 0.1 where / is an integer, 
it must be rounded to / in order to keep the discrepancy small. This forces Si = U± = 0, 
U 3 = U4 = 1, U 5 = Uq = 2, Uf = U 8 = 3, Ug = 4, hence x\ = x 2 = £3 = £4 = X5 = 
xq = 0. But then S 6 = x~i + h x 6 = differs by 0.9 from Sq. □ 

Although Theorem 2 proves that Theorem 1 is "optimal," we can do still better if m 
is greater than |n, because we can replace each Xk by 1 — Xk- This replaces m by n — m, 
and the bound on discrepancy decreases to (2n — 2m + l)/(2n — 2m + 2). Then we can 
restore the original Xk and change xu to 1 — xu- This computation preserves \Sk — Sk\ and 
\Uk — 2Jk\i so it preserves the discrepancy. 

Further improvement is also possible when m = \nj 2 J, if we look at the construc- 
tion closely. The following theorem gives a uniform bound in terms of n, without any 
assumption about the value of x\ + • • • + x n . 

Theorem 3. Any sequence (x±, . . . , ^Cfl^ Sill d permutation (al, . . . , an) can be two-way 
rounded with discrepancy bounded by n/(n + 1). 

Proof. We will show in fact that the discrepancy can always be bounded by (n — l)/n, 
when x\ H — ■ + x n = m is an integer. The general case follows from this special case if we 
set x n +i = \S n ~\ — S n and increase n by 1. 

If 2m + 2 < n or 2n — 2m + 2 < n, the result follows from Theorem 1 and possible 
complementation. Therefore we need only show that a discrepancy of at most (n — l)/n 
is achievable when m = [n/2\. 
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Consider first the case n = 2m + 1. We use the network in the proof of Theorem 1, 
but now we allow e to be any number < 1/n. Suppose, as in the former proof, that we 
can disconnect s from t by deleting p source arcs, q middle arcs, and r sink arcs, where 
p + q + r = m — 1. Let q be minimum over all such ways to disconnect the network. We 
construct flows and antiflows as before, and we say that Xk is green if Vk contains positive 
flow, red if Vk contains positive antiflow. No Xk is both green and red, since there is no 
path from s to t. The previous proof showed that there are at least (r + 1)(1 — 2e) units of 
green flow and (m — r) (1 — 2e) units of red flow, hence there are at least m + 1 — (2m + 2)e 
units of flow altogether. If we can raise this lower bound by e, we will have a contradiction, 
because m + 1 — (2m + l)e > m. 

Suppose q > 0, and let Uk — > Vk be a middle arc that was deleted. At most two arcs 
emanate from Vk in the network. Since q is minimum, there must in fact be two; otherwise 
we could restore Uk — * Vk and delete a non-middle arc. The two arcs from Vk must be 
consecutive, from Vk — > bj and Vk — > fy+i, say. Furthermore the arcs bj — ■> t and fy+i — > t 
have not been cut. If /c = a/ we have < j — e and £i > j + e. Our lower bound on 
antiflow can now be raised by 2e, because it was based on the weak assumption that no 
antiflow runs back from [j — e . . j + e). This improved lower bound leads to a contradiction; 
hence q = 0. 

Divide the interval [0 . . m) into 3m regions, namely "tiny left" regions of the form 
[j — 1 . . j — 1 + e) , "inner" regions of the form [j — 1 + e . . j — e), and "tiny right" regions 
of the form [j — e . . j), for 1 < j < m. If we color the points of [Sk-i ■ ■ Sk) with the 
color of Xk, our lower bound (r + 1)(1 — 2e) for green flow was essentially obtained by 
noting that m — p = r + 1 of the inner regions are purely green. Similarly, if we color the 
ponts of [Sk-i ■ ■ ^k) with the color of x a k, our lower bound for red flow was obtained by 
noting that m — r = p + 1 inner regions in this second coloring are purely red. Notice that 
there is complete symmetry between red and green, because we can invert the network and 
replace a by a -1 . 

Call an element Xk large if it exceeds 1 — e. If any Xk is large, the interval [S^-i • • Sk) 
occupies more than e units outside of an inner region; this allows us to raise the lower 
bound by e and obtain a contradiction. Therefore no element is large. It follows that no 
element Xk can intersect more than 2 tiny regions, when Xk is placed in correspondence 
with [S k -i ■ ■ S k ) or with [U k -i ■ ■ A)- 

Let's look now at the 2m tiny regions. Each of them must contain at least some red in 
the first coloring; otherwise we would have at least (p+ 1)(1 — 2e) red units packed into at 
most 2m — 1 tiny regions and p inner regions, hence (p + 1)(1 — 2e) < (2m — l)e +p(l — 2e), 
contradicting e < 1/n. This means there must be at least m + 1 red elements Xk, since no 
red element is large and since m non-large red intervals can intersect all the tiny regions 
only if they also cover all the inner regions (at least one of which is green). Similarly, there 
must be at least m + 1 green elements. But this is impossible, since there are only 2m + 1 
elements altogether. Therefore the network has minimum cut size m, and the rest of the 
proof of Theorem 1 goes through as before. 

Now suppose n = 2m. Then we can carry out a similar argument, but we need to 
raise the lower bound by 2e. Again we can assume that q = 0. We can also show without 
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difficulty that there cannot be two large elements. When n = 2m the argument given 
above shows that at least 2m — 1 of the tiny regions must contain some red, in the first 
coloring. 

Suppose there are only m — 1 red elements. Then, in the first coloring, m — 2 of them 
intersect 2 tiny intervals and the other is large and intersects 3; we have raised the red 
lower bound by e. But (p + 1)(1 — 2e) + e red units cannot be packed into 2m — 1 tiny 
regions and p inner regions, because (p + 1)(1 — 2e) + e > (n — l)e + p(l — 2e). 

A symmetrical argument shows that there cannot be only m — 1 green elements. 
Therefore exactly m elements are red and exactly m are green. Suppose no element is 
large. Then we have at least one purely green tiny interval in the first coloring and at least 
one purely red tiny interval in the second — another contradiction. Thus, we may assume 
that there is one large red element, and that the 2m tiny intervals in the first coloring 
contain a total of less than e units of green. In particular, each of them contains some red. 
Either the first interval [0 . . e) or the last interval [m — e . . m) is intersected by a non-large 
red element, which intersects at most e units of space in tiny intervals. The other m — 1 
red elements intersect at most 2e units of tiny space each, so at most (2m — l)e such units 
are red. This final contradiction completes the proof. □ 

The result of Theorem 3 is best possible, because we can easily prove (as in Theorem 2) 
that the values 

1 ( (n — l)/(n + 1) , k even, 2 < k < n 

Xl ~ n+ 1 ' Xk ~ I 2/(n+ 1), /codd, 3<k<n 

and a "shuffle" permutation that begins 

f 2k - 1 for 1< 2k - 1< n, n odd 
ok = { 

{ 2k for 1 < 2k < n, n even 

cannot be two-way rounded with discrepancy less than n/(n+ 1). 

So far we have discussed only worst-case bounds. But a particular two-way rounding 
problem, defined by values (x±, . . . , x n ) and a permutation (al, . . . , an), will usually be 
solvable with smaller discrepancy than guaranteed by Theorems 1 and 3. A closer look at 
the construction of Theorem 1 leads to an efficient algorithm that finds the best possible 
discrepancy in any given case. 

Theorem 4. Let e be any positive number. There exists a solution with discrepancy less 
than 1 — e to a given two-way rounding problem if and only if the network constructed in 
the proof of Theorem 1 supports an integer flow of value m. 

Proof. The final paragraph in the proof of Theorem 1 demonstrates the "if" half. 
Conversely, suppose xi,...,x n is a solution with discrepancy < 1 — e. If x^ = 1, let 
j = Sk- Then j — 1 = Sk-i, so the condition l^-i — Sk-i\ < 1 — e implies Sk-i < j — e. 
Also \Sk — Sk | < 1 — e implies Sk > j — I + e. Therefore there is an arc cij — > Uk- Similarly, 
there is an arc iVfc — > bj when x a k = 1 and j = Sk- So the network supports an integer 
flow of value m. □ 
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In other words, the optimum discrepancy 5 = 1 — e is obtained when e is just large 
enough to reduce the network to the point where no m-unit flow can be sustained, if S > |. 
We can in fact find an optimum rounding as follows: Let 

f(j, k) = min(j - S k -i, S k - j + 1) 

be the "desirability" of the arc cij — > u k , and 

g(j, <rk) = mm(j - E k -i, S k - j + 1) 

the desirability of v ak — > bj. (Thus the arcs cij — > u CT A: — > bj are included in the 
network of Theorem 1 if and only if their desirability is greater than e.) Sort these arcs by 
desirability, and add them one by one to the initial arcs {s — > a,j,u k — > v k ,bj — > t} until 
an integer flow of m units is possible. Then let x k be the flow in u k v k , for all k; this 
flow has discrepancy equal to 1 minus the desirability of the last arc added, and no smaller 
discrepancy is possible. 

Notice that the arc cij — > u k has desirability > ^ if and only if S k -i < j — ^ < S k , so 
at most m such arcs are present. If all x k lie between and 1, at most m + n — 1 arcs of 
the form a,j — > u k will have positive desirability, since both cij-\ — > -u^ and cij — >■ -u^ will 
be desirable iff S^-i < j < S k . 

The following simple algorithm turns out to be quite efficient, assuming that m < \n\ 
Begin with the network consisting of arcs {s — > a,j,u k — > v k ,bj — > t} for 1 < j < m 
and 1 < /c < n, plus any additional arcs of desirability > |. Call an arc a,j — > or 
^crfc ~~ 6j "special" if its desirability lies between 1/ min(2m + 2,n) and |, inclusive; fewer 
than 2m + 2n arcs are special. Then, for j = 1, . . . , m, send one unit of flow from a,j to t 
along an "augmenting path," using the well-known algorithm of Ford and Fulkerson [2, 
pp. 17-19] but specialized for unit-capacity arcs. In other words, construct a breadth-first 
search tree from a,j until encountering t; then choose a path from a,j to t and reverse the 
orientation of all arcs on that path. If t is not reachable from a,j, add special arcs to the 
network, in order of decreasing desirability, until t is reachable. 

The running time of this algorithm is bounded by 0{mn) steps, but in practice it runs 
much faster on random data. For example, Tables 1 and 2 show the results of various tests 
when the input permutation a is random and when the values (x\, . . . , x n ) are selected as 
follows: Let yi,...,y n be independent uniform integers in the range 1 < y k < N, where N 
is a large integer (chosen so that arithmetic computations will not exceed 31 bits). Increase 
one or more of the y's by 1, if necessary, until y\ + • • • + y n is a multiple of m; then set 
Xk = yk/d, where d = (y\ + • • • + y n )/m. Reject (xi, . . . , x n ) and start over, if some x k > 1. 
(In practice, rejection occurs about half the time when m = ^n, but almost never when 
m < \n.) 

Table 1 shows the optimum discrepancies found, and Table 2 shows the running time 
in memory references or "mems" [6, pp. 464-465] divided by n. All entries in these tables 
are given in the form \x ± cr, where // is the sample mean and a is an estimate of the 
standard deviation; more precisely, a is the square root of an unbiased estimate of the 
variance. The number of test runs t(n) for each experiment was 10 6 /n; thus, 10 5 runs 
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were made for each m when n = 10, but only 10 runs were made for each m when n = 
10 5 . The actual confidence interval for the tabulated \i values is therefore approximately 
2a/y/i{n) = .002a^/n. 
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Table 1. Empirical optimum discrepancies 







m = 1 


m = 2 


m = \ lgn\ 


m = \_y/n\ 


m = |n 


n = 


10 


.566 ±.06 


.619 ± .07 


.627 ± .07 


.627 ± .07 


.622 ± .08 


n = 


100 


.537 ±.02 


.575 ± .03 


.664 ± .03 


.710 ± .03 


.759 ± .02 


n = 


1000 


.513 ±.007 


.527 ± .01 


.582 ± .01 


.662 ± .02 


.794 ± .02 


n = 


10000 


.504 ± .002 


.509 ± .003 


.535 ± .005 


.612 ± .01 


.818 ± .01 


n = 


100000 


.502 ± .001 


.503 ± .001 


.513 ± .002 


.570 ± .005 


.838 ± .007 



Table 2. Empirical running time, in mems/n 







m = 1 


m = 2 


m = [lgn\ 


m = \yjn\ 


m = \n 


n = 


10 


10 ±4 


19 ±6 


27 ±8 


27 ±8 


37 ± 11 


n = 


100 


2.9 ± 1.3 


6±2 


18 ±5 


29 ±7 


76 ± 15 


n = 


1000 


0.9 ±0.5 


1.9 ±0.7 


8.5 ±2.2 


25 ±6 


152 ±32 


n = 


10000 


0.3 ±0.2 


0.6 ±0.2 


3.6 ±0.8 


22 ±7 


289 ± 49 


n = 


100000 


0.1 ±0.1 


0.2 ±0.1 


1.4 ±0.4 


17 ±4 


540 ± 72 



Notice that when m<n, the optimum discrepancy is nearly |. Indeed, this is obvious 
on intuitive grounds: When n is large, approximately en values of k will have Sk within 
|e of {| , !,..., m — |}, and approximately e 2 n will also have equally good values E a -i k . 
So we are essentially looking for a perfect matching in a bipartite graph with m vertices 
in each part and e 2 n edges. For fixed masrn oo, the matching will exist when e 2 n is 
sufficiently large, hence the mean optimum discrepancy is ^ ± 0{n~^). 

However, the behavior of the mean optimum discrepancy when m = \n is not clear. 
It appears to approach 1, but quite slowly, perhaps as 1 — c/logn. 

When n is fixed and m varies, the mean optimum discrepancy is not maximized when 
m = \n. For example, when n = 10, Table 1 shows that it is .622 when m = 5 but .627 
when m = 3. 

The running times shown in Table 2 do not include the work of constructing the net- 
work or sorting the special arcs by desirability. Those operations are easily analyzed, and 
in practice they take am + bn steps for some constants a and b, because a straightforward 
bucket sort is satisfactory for this application. Therefore only the running time of the 
subsequent flow calculations is of interest. 

The average running time to compute the flows appears to be o(n) when m < \fn, 
and approximately proportional to n 13 when m = \n. So it is much less than the obvious 
upper bound ran of the Ford-Fulkerson scheme. The author tried to obtain still faster 
results by using more sophisticated max-flow algorithms, but these "improved" algorithms 
actually turned out to run more than an order of magnitude slower. 

For example, the algorithm of Dinits, as improved by Karzanov and others, seems at 
first to be especially well suited to this application because the network of Theorem 1 is 
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"simple" in the sense discussed by Papadimitriou and Steiglitz [9, pp. 212-214]: Every in- 
ternal vertex has in-degree 1 or out-degree 1, hence edge-disjoint paths are vertex-disjoint 
and the running time with unit-capacity arcs is 0( |U| 1//2 \A\ ) = 0(n 3 / 2 ). Using binary 
search to find the optimum number of special arcs gives us a guaranteed worst-case per- 
formance of 0(min(m, n 1//2 )n logn). Unfortunately, in practice the performance of that 
algorithm actually matches this worst-case estimate, even on random data. For example, 
when m = \n the observed running time in mems/n was 15284 ± 2455 when n = 10 4 , 
compared to 289 ± 49 by the simple algorithm. Each flow calculation consumed more than 
lOOOn mems, and binary search required |Tg 2n\ = 14 flow calculations to be carried out. 

When modern preflow push/relabel algorithms are specialized to unit-capacity net- 
works of the type considered here, they behave essentially like the Dinits algorithm and 
are no easier to implement (see Goldberg, Plotkin, and Vaidya [4]). Such algorithms do 
allow networks to change dynamically by adding arcs from s and/or deleting arcs to t (see 
Gallo, Grigoriadis, and Tarjan [3]); but our application requires adding or deleting special 
arcs in the middle of the network, so the techniques of [3] do not apply. Thus the simple 
Ford-Fulkerson algorithm seems to be a clear winner for this application, in spite of a lack 
of performance guarantees. 

How complex can the networks of Theorem 1 be? If we have any bipartite graph 
with m vertices in each part and with n edges, and if every edge can be extended to a 
perfect matching, then we can find real numbers ( ) in the range < Xk < 1 and 

a permutation (crl, . . . , an) such that x± + • — h x n = m and the two-way roundings are in 
one-to-one correspondence with the perfect matchings of the given graph. For we can take 
(xi, . . . , x n ) = tiCKi + • — h t n a n where t± + • — h t n = 1 and otk is the characteristic vector 
of a perfect matching that uses edge k. The sum of Xk over all the edges touching any 
vertex is 1. Represent an edge from u to v by the ordered pair (u, v), and label the edges 
1, . . . , n in lexicographic order of these pairs; then define the permutation crl, . . . , an by 
lexicographic order of the dual pairs (v, u). It follows that if k is the final edge for vertex j 
in the first part, we have Sk = j] and if ak is the final edge for vertex j in the second part, 
we have = j- The correspondence between matchings and roundings is now evident. 

This construction shows that the networks arising in Theorem 1 are general enough to 
mimic the networks that arise in bipartite matching problems, but only when the bipartite 
graphs contain no unmatchable edges; and the corollary preceding Theorem 1 shows that 
the latter restriction cannot be removed. This restriction on network complexity might 
account for the excellent performance we obtain with the simple Ford-Fulkerson algorithm. 

If the capacity constraint on uu — * vu is removed, our network becomes equivalent to 
a network for bipartite matching, in which we want to match {ai, . . . , a m } to {&i, . . . , b m } 
through edges cij — by whenever cij — > Uk and Vk — > by. The problem of finding the best 
such match, when the edge a,j — by is ranked by the minimum of the desirabilities f(j, k) 
and g(j', k), is then a bottleneck assignment problem [1, 2]. (Open question: Is there a nice 
way to characterize all bottleneck assignment problems that arise from two-way rounding 
problems in this manner?) 

The problem of optimum two-way rounding is, however, more general than the bottle- 
neck assignment problem, because the unit capacity constraint on uu — * Vk is significant. 
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Consider, for example, the case n = 7, m = 3, (xi, . . . , X7) = ^(8,8,24,11,11,11,11), 
(al,...,a7) = (2,1,3,5,4,7,6). Then {S 1 , . . . , S 7 ) = {E U ...,U 7 ) = ^(8,16,40,51,62, 
73, 84), and the arcs {cij — > Uk, Vk — > ranked by desirability are 



^6, ^7 — > ^3 desirability = min( || , || ) = || 

ui — > 6i, a 2 — > «4, u 5 — > 62 desirability || 

"3> ^3 — >■ 61 j ^2 — >■ w 3 , d 3 — > b 2 desirability |§ 

tt7, t>g — > 63 desirability || 

ui, i>2 — ► &i desirability ^ 

M5, ^4 — > 63 desirability ^ 

•U5, — * 62 desirability ^ 



a 3 
ai 
ai 
a 3 
ai 

a 3 -> it 5 , v 4 



a 2 

Thus the edges a,- — &,/ ranked by desirability are 

ai — 61, ai — 6 2 , a2 — h, a2 — ^2 ( |§ via it 3 , u 3 ) 

a 3 — 63 ( || via u 6 ,v 6 or ^7,^7) 

ai — bi ( ^ via tti, i>i or u 2 , ^2) 

a 2 — 63, a 3 — 6 2 ( ^ via u 4 , v A or it 5 , v 5 ) 

a 2 — b 2 ( 5g via it 4 , i> 4 or it 5 , i> 5 ) 

The bottleneck assignment problem is solved by matching ai — 61, a 2 — b 2 , and 03 — 63 
with desirability min ( || , || , || ) = ||. But this matching does not correspond to a valid 
two-way rounding because it uses the intermediate arc 113 — > u 3 twice; it rounds x 3 to 2 
and ^6 (or £7) to 1. The optimum two-way rounding uses another route from a± to b± and 
has desirability min ( ^ , |§ , || ) = discrepancy 1 — ^ = 2§; it rounds xi (or X2), x 3 , 
and xq (or X7) to 1, the other x's to 0. 

In closing, we note that a conjecture of Jozsef Beck [7, 11] remains a fascinating open 
problem: Is there a constant K such that three-way rounding is always possible with 
discrepancy at most Kl (in three-way rounding the partial sums are supposed to be well 
approximated with respect to a third permutation (rl, . . . , rn), in addition to (1, . . . , n) 
and (crl, . . . , an).) It suffices [7, 11] to prove this when Xk = \ for all k. 

Can any of the methods of this paper be extended to find better bounds on the 
discrepancy of arbitrary set systems (or at least of set systems more general than those for 
two-way rounding), in the sense of [11]? 



Acknowledgments. I wish to thank Joel Spencer for proposing the problem and for 
showing me a simple construction that forces discrepancy nj (n+ 1). Thanks also to Noga 
Alon, Svante Janson, and Serge Plotkin for several stimulating discussions as I was working 
out the solution described above. Shortly after I had proved Theorems 1-3, a somewhat 
similar construction was found independently by Jacek Ossowski, who described it in terms 
of common systems of distinct representatives instead of network flows; see §9.2 in [8]. 
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